Dealing with Web Data: History and Look ahead
نویسندگان
چکیده
The high rate of change and the unprecedented scale of the Web pose enormous challenges to search engines who wish to provide the most up-to-date and highly relevant information to its users. The VLDB 2000 paper ”The Evolution of the Web and Implications for an Incremental Crawler” tried to address part of this challenge by collecting and analyzing the Web history data and by describing the architecture and the associated algorithms for an incremental Web crawler that can provide more up-to-date data to users in a timely manner. Experiments and theoretical analysis showed — surprisingly at the time — that a policy that allocates more resources to more frequently changing items does not necessarily lead to better performance. In this paper, we discuss what has happened in the 10 years since and talk about the challenges that lie head.
منابع مشابه
Accelerating Decoupled Look-ahead to Exploit Implicit Parallelism
Despite the proliferation of multi-core and multi-threaded architectures, exploiting implicit parallelism for a single semantic thread is still a crucial component in achieving high performance. While a canonical out-of-order engine can effectively uncover implicit parallelism in sequential programs, its effectiveness is often hindered by instruction and data supply imperfections (manifested as...
متن کاملSpeculative Parallelization in Decoupled Look-ahead Architectures
One well known approach to mitigate the impact of branch mispredictions and cache misses is to enable deep lookahead so as to overlap instruction and data supply with instruction processing. A continuous look-ahead process which uses separate thread of control on another hardware contexts is one such approach which we call decoupled look-ahead [1], [2]. However, in such look-ahead schemes, look...
متن کاملوب مرئی و نامرئی: تجزیه و تحلیل استفاده از محیط وب بر اساس مدل ایدهآل تیپ ماکس وبر
Using the Web has become ubiquitous and an indispensable part of scientists’ daily life. Although there are many studies dealing with the use of the Web, few studies have focused on how different user groups including scientists make use of visible and invisible parts of the Web for educational and research purposes. This article first introduces the visible and invisible parts of the Web, and ...
متن کاملInfluence of spatial ability in navigation: using look-ahead breadcrumbs on The Web
Spatial implications of the commonly used ‘navigation’ metaphor have lead many researchers to investigate the relation between individual differences and navigation. This study presents an exploratory survey on the influence of spatial ability, the most incisive aspect of individual difference for navigation, when people try to accomplish their goal in the information space. There are still dif...
متن کاملA look-ahead model for the elongation dynamics of transcription.
This article introduces a chemical kinetic model of the transcriptional elongation dynamics of RNA polymerase. The model's novel concept is a look-ahead feature, in which nucleotides bind reversibly to the DNA before being incorporated covalently into the nascent RNA chain. Analytical and computational methods for studying the behavior of the look-ahead model are introduced, and several approac...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- PVLDB
دوره 3 شماره
صفحات -
تاریخ انتشار 2010